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Introduction 

Application of Grobner bases theory to designed experiments is one of the main branches 
in a relatively new field in statistics, called computational algebraic statistics. The first 
work in this branch is given by Pistone and Wynn ([20]). in this paper, they presented a 
method to handle fractional factorial designs algebraically by denning design ideals. As 
one of the merits to consider design ideals, confounding relations between the factor effects 
can be generalized naturally from regular to non-regular designs and can be expressed 
concisely by the Grobner bases theory. See [20] or [12] for details. After this work, various 
algebraic techniques based on the Grobner bases theory are applied to the problems of 
designed experiments both by algebraists and statisticians. For example, an indicator 
function defined in [TXJ is a valuable tool to characterize non-regular fractional factorial 
designs. 

On the other hand, there is another main branch in the field of computational algebraic 
statistics. In this branch, a key notion is a Markov basis, which is defined by Diaconis 
and Sturmfels ([7]). In this work, they established a procedure for sampling from discrete 
conditional distributions by constructing a connected Markov chain on a given sample 
space. Since this work many papers are given considering Markov bases for various sta- 
tistical models, especially for the hierarchical models of multi-dimensional contingency 
tables. Intensive results on the structure of Markov bases for various statistical models 
are given in [3]. 
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The arguments in this paper relates to both of the two branches mentioned above. In 
fact, the motivation of this paper is of interest to investigate statistical problems which 
are related to both designed experiments and Markov bases. This paper is based on the 
first works with this motivation, [5] and [3]. In these works, Markov chain Monte Cairo 
methods for testing factor effects are discussed, when observations are discrete and are 
given in the two-level or three-level regular fractional factorial designs. As one of the 
contributions of these works, the relation between the statistical models for the regular 
fractional factorial designs and contingency tables is considered through Markov bases. 
As a consequence, to investigate the Markov bases arising in the problems of designed 
experiments, we can refer to the known results on the corresponding models for the 
contingency tables. For example, we see that the Markov basis for the main effect models 
of the regular 2j^ 2 design given by the defining relation ABD = ACE = I is constructed 
only by the square- free degree 2 elements (|5j). This is because the corresponding model 
in the contingency tables is the conditional independence model in the 2x2x2 table. 
Note that the conditional independence model in the three-way contingency table is an 
example of decomposable models and we know the fact that a minimal Markov basis for 
this class of models can be constructed only by square-free degree 2 elements. See [9] for 
detail. 

In this paper, following the Markov chain Monte Carlo approach in the designed 
experiments by [5] and [3], we give a new results on the correspondence between the 
regular two-level design and the algebraic concept, namely cut ideals defined in [2T] . 
Because the Markov bases are characterized as the generators of well-specified toric ideals 
and are studied not only by statisticians but also by algebraists, it is valuable to connect 
statistical models to known class of toric ideals. In this paper, we give a fundamental fact 
that the generator of cut ideals can be characterized as the Markov bases for the testing 
problems of log-linear models for the two-level regular fractional factorial designs. 

The construction of this paper is as follows. In Section [1] we review Markov chain 
Monte Carlo approach for testing the fitting of the log-linear models when observations 
are discrete random variables. The main results of this paper are given in Section |2j We 
show how to relate cut ideals to the fractional factorial designs and give the main theorem. 
In Section [31 we apply known results on the cut ideals to the regular fractional factorial 
designs. 

1 Markov chain Monte Carlo method for regular two- 
level fractional factorial designs 

In this section we introduce Markov chain Monte Carlo methods for testing the fitting 
of the log-linear models for regular two-level fractional factorial designs with count ob- 
servations. Suppose we have nonnegative integer observations for each run of a regular 
fractional design. For simplicity, we also suppose that the observations are counts of some 
events and only one observation is obtained for each run. This is natural for the settings of 
Poisson sampling scheme, since the set of the totals for each run is the sufficient statistics 
for the parameters. We begin with an example. 
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Table 1: Design and number of defects y for the wave-solder experiment 

Factor y 



Run 


A 


B 


C 


D 


E 


F 


G 


1 


2 


3 


1 


1 


1 


1 


1 


1 


1 


1 


13 


30 


26 


2 


1 


1 


1 


2 


2 


2 


2 


4 


16 


11 


3 


1 


1 


2 


1 


1 


2 


2 


20 


15 


20 


4 


1 


1 


2 


2 


2 


1 


1 


42 


43 


64 


5 


1 


2 


1 


1 


2 


1 


2 


14 


15 


17 


6 


1 


2 


1 


2 


1 


2 


1 


10 


17 


16 


7 


1 


2 


2 


1 


2 


2 


1 


36 


29 


53 


8 


1 


2 


2 


2 


1 


1 


2 


5 


9 


16 


9 


2 


1 


1 


1 


2 


2 


1 


29 





14 


10 


2 


1 


1 


2 


1 


1 


2 


10 


26 


9 


11 


2 


1 


2 


1 


2 


1 


2 


28 


173 


19 


12 


2 


1 


2 


2 


1 


2 


1 


100 


129 


151 


13 


2 


2 


1 


1 


1 


2 


2 


11 


15 


11 


14 


2 


2 


1 


2 


2 


1 


1 


17 


2 


17 


15 


2 


2 


2 


1 


1 


1 


1 


53 


70 


89 


16 


2 


2 


2 


2 


2 


2 


2 


23 


22 
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Example 1.1 (Wave-soldering experiment). Table [T7T1 is a 1/8 fraction of a full factorial 
design (i.e., a 2 7-3 fractional factorial design) defined from the defining relation 

ABDE = ACDF = BCDG = I, (1) 

and response data analyzed in [6] and reanalyzed in [14] . In Table [TTj the observation 
y is the number of defects arising in a wave- soldering process in attaching components 
to an electronic circuit card. In Chapter 7 of [6], he considered seven factors of a wave- 
soldering process: (A) prebake condition, (B) flux density, (C) conveyer speed, (D) preheat 
condition, (E) cooling time, (F) ultrasonic solder agitator and (G) solder temperature, 
each at two levels with three boards from each run being assessed for defects. The aim 
of this experiment is to decide which levels for each factors are desirable to reduce solder 
defects. 

Because we only consider designs with a single observation for each run in this paper, 
we focus on the totals for each run in Table 11.11 We also ignore the second observation 
in run 11, which is an obvious outlier as pointed out in [H]. Therefore the weighted total 
of run 11 is (28 + 19) x 3/2 = 70.5 ~ 71. By replacing 2 by —1 in Table we rewrite 
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k x p design matrix as D, where each element is +1 or —1. Consequently, we have 



D 



/ 


1 1 

\ J- 


1 1 

\ 1 


1 1 


I 1 


I 1 


1 1 
T-L 


+ 1 \ 




/ 




\ 




+1 


+1 


+ 1 


-1 


-1 


-1 


-1 






31 






+1 


+1 


-1 


+ 1 


+ 1 


-1 


-1 






55 






+1 


+1 


-1 


-1 


-1 


+ 1 


+ 1 






149 






+1 


-1 


+ 1 


+ 1 


-1 


+ 1 


-1 






46 






+1 


-1 


+ 1 


-1 


+ 1 


-1 


+ 1 






43 






+1 


-1 


-1 


+ 1 


-1 


-1 


+ 1 






118 






+1 


-1 


-1 


-1 


+ 1 


+ 1 


-1 






30 






-1 


+1 


+ 1 


+ 1 


-1 


-1 


+ 1 


, y = 




43 






-1 


+1 


+ 1 


-1 


+ 1 


+ 1 


-1 






45 






-1 


+1 


-1 


+ 1 


-1 


+ 1 


-1 






71 






-1 


+1 


-1 


-1 


+ 1 


-1 


+ 1 






380 






-1 


-1 


+ 1 


+ 1 


+ 1 


-1 


-1 






37 






-1 


-1 


+ 1 


-1 


-1 


+ 1 


+ 1 






36 






-1 


-1 


-1 


+ 1 


+ 1 


+ 1 


+ 1 






212 




V 


-1 


-1 


-1 


-1 


-1 


-1 


"I J 




V 


52 


/ 



In this paper, we consider designs of p factors with two-level. We write the observations 
as y = {vii ■ ■ ■ > Vk)', where k is the run size and ' denotes the transpose. Write the design 
matrix D = (<%), where dij G { — 1, 1} is the level of the j-th factor in the z-th run for 



l,...,k,j 



.p. 



In this case it is natural to consider the Poisson distribution as the sampling model, 
in the framework of generalized linear models ([H]). The observations y are realizations 
from k Poisson random variables Y\, . . . , Y k , which are mutually independently distributed 
with the mean parameter /ij = E(Yj),i = 1, . . . , k. We call the log-linear model written 
by 



logyUi = p + fiida H h /3 p d, 



k 



(2) 



as the main effect model in this paper. The equivalent model in the matrix form is 



/ log //i \ 



MP, 



where P = (Pq, Pi, ... , P p )' and 



M 



( 1 



V 



D 



(3) 



/ 



We call the k X (p + 1) matrix M a model matrix of the main effect model. The interpre- 
tation of the parameter Pj in ([2]) is the parameter contrast for the main effect of the j-th 
factor. Following the arguments of [5], we can also consider the models including various 
interaction effects. In this paper, we first describe our methods for the main effect models 
and will consider how to treat interaction effects afterward. 



4 



To judge the fitting of the main effect model we can perform various goodness- 
of-fit tests. In the goodness-of-fit tests, the main effect model ([2]) is treated as the null 
model, whereas the saturated model is treated as the alternative model. Under the null 
model ([2]), (3 is the nuisance parameter and the sufficient statistic for j3 is given by M'y = 
(Yli=iyiiYli=idiiyi, ■ ■ ■ ,Yli=idipyi)' ■ Then the conditional distribution of y given the 
sufficient statistics is written as 

k 

f( y I M'y = M'y°) = — ^ TT i-, (4) 

y y ' C{M'y°) f \ Vi V K J 

where y° is the observation count vector and C(M'y°) is the normalizing constant deter- 
mined from M'y° written as 

y&T{M'y°) \i=l yi ' J 

and 

J- {M'y ) = {y | M'y = M'y , yi is a nonnegative integer for i — 1, . . . , k}. (6) 

Note that by sufficiency the conditional distribution does not depend on the values of the 
nuisance parameters. 

In this paper we consider various goodness-of-fit tests based on the conditional dis- 
tribution (jl]). There are several ways to choose the test statistics. For example, the 
likelihood ratio statistic 

k 

T(y) = G 2 (y) = 2j>log^ (7) 

is frequently used, where fii is the maximum likelihood estimate for /ij under the null 
model (i.e., fitted value). Note that the traditional asymptotic test evaluates the upper 
probability for the observed value T{y°) based on the asymptotic distribution xt-p-v 
However, since the fitting of the asymptotic approximation may be sometimes poor, we 
consider Markov chain Monte Carlo methods to evaluate the p values. Using the condi- 
tional distribution (JJJ), the exact p value is written as 

P = E fa I = M'y°)l{T{y) > T(y°)), (8) 

where 

1, ifr(y)>r(y») 



KT(y) > T(y°)) = | ; oth -.- « " (9) 

Of course, if we can calculate the exact p value of ([S]) and (jUj), it is best. Unfortunately, 
however, an enumeration of all the elements in J-^M'y ) and hence the calculation of 
the normalizing constant C{M'y°) is usually computationally infeasible for large sample 
space. Instead, we consider a Markov chain Monte Carlo method. Note that, as one of 
the important advantages of Markov chain Monte Carlo method, we need not calculate 
the normalizing constant §5§ to evaluate p values. 
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To perform the Markov chain Monte Carlo procedure, we have to construct a con- 
nected, aperiodic and reversible Markov chain over the conditional sample space ([6]) with 
the stationary distribution fl4]). If such a chain is constructed, we can sample from the 
chain as . . . , y^ T ^ after discarding some initial burn-in steps, and evaluate p values as 



p=^l(T(y( f ))>T(y )). 



t=i 



Such a chain can be constructed easily by Markov basis. Once a Markov basis is calculated, 
we can construct a connected, aperiodic and reversible Markov chain over the space (Q, 
which can be modified so that the stationary distribution is the conditional distribution 
fllj) by the Metropolis-Hastings procedure. See [7J and [T5] for details. 

Markov basis is characterized algebraically as follows. Write indeterminates x±, . . . ,Xk 
and consider polynomial ring K[xi, . . . , x^] for some field K. Consider the integer kernel 
of the transpose of the model matrix M, Ker^M'. For each b = (61, . . . , bk)' e Ker^M', 
define binomial in K [xi, .... Xk] as 



■ II II 



bi TT -bj 



bj>0 bj<0 

Then the binomial ideal in K[xi, . . . , X}~], 

I(M') = ({f h I b G Ker z M'}> , 

is called a toric ideal with the configuration M' . Let {f^m, ■ ■ ■ , fh(")} be any generating 
set of I(M'). Then the set of integer vectors {b^, . . . , b^ s ^} constitutes a Markov basis. 
See [7] for detail. To compute a Markov basis for given configuration M', we can rely on 
various algebraic softwares such as 4ti2 (PQ). See the following example. 



Example 1.2 (Wave-soldering experiment, continued). We analyze the data in Table [LT1 
The fitted value under the main effect model is calculated as 

A = (68.87, 19.70, 78.85, 147.59, 12.14, 54.77, 104.53, 54.54, 
75.31, 39.29, 75.00, 338.37, 27.83, 52.09, 208.47, 59.64)'. 

Then the likelihood ratio for the observed data is calculated as T(y°) = G 2 (y°) = 117.81 
and the corresponding asymptotic p value is less than 0.0001 from the asymptotic distribu- 
tion xt- This result tells us that the null hypothesis is highly significant and is rejected, 
i.e., the existence of some interaction effects is suggested. To evaluate the p value by 
Markov chain Monte Carlo method, we have to calculate a Markov basis first. If we use 
4ti2, we prepare the data file (configuration M') as 

8 16 

1111111111111111 
1 1 1 1 1 1 1 1-1-1-1-1-1-1-1-1 
1 1 1 1-1-1-1-1 1 1 1 1-1-1-1-1 
1 1-1-1 1 1-1-1 1 1-1-1 1 1-1-1 
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1-1 1-1 1-1 1-1 1-1 1-1 1-1 1-1 

1-1 1-1-1 1-1 1-1 1-1 1 1-1 1-1 

1-1-1 1 1-1-1 1-1 1 1-1-1 1 1-1 

1-1-1 1-1 1 1-1 1-1-1 1-1 1 1-1 

and run the command markov. Then we have a minimal Markov basis with 77 elements 
as follows. 



77 16 



























1 


1 


-1 


-1 


-1 


-1 


1 


1 

















1 


-1 





1 








-1 


-1 


-1 


1 


1 

















1 





-1 





1 





-1 


-1 


-1 


1 


1 














1 





-1 





1 





-1 





-1 


-1 


1 


1 














1 








-1 





1 


-1 





-1 


-1 


1 


1 














1 


1 


-1 


-1 














-1 


-1 


1 


1 











1 








-1 





1 





-1 


-1 





-1 


1 


1 



Using this Markov basis, we can evaluate p value by Markov chain Monte Carlo method. 
After 50, 000 burn-in-steps from y° itself as the initial state, we sample 100, 000 Monte 
Carlo sample by Metropolis-Hasting algorithm, which yields p = 0.0000 again. Figure 
[T]is a histogram of the Monte Carlo sampling of the likelihood ratio statistic under the 
main effect model, along with the corresponding asymptotic distribution xt- 




Figure 1: Asymptotic and Monte Carlo estimated distribution of likelihood ratio 



If the fitting of the main effect model is poor, or we have some prior knowledge on 
the existence of interaction effects, we consider the models including interaction effects. 
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The models including the interaction effects are also described by the model matrix. 
The modeling method presented in [5] is as follows. If we want to consider the models 
including interaction effects, add columns to the model matrix of the main effect model 
(J3J) so that the corresponding parameter can be interpreted as the parameter contrast for 
the additional interaction effect. We describe this point by the previous example. See [5] 
for details. 

Example 1.3 (Wave-soldering experiment, continued). As is pointed out in [14], the 
existence of some interaction effects is suggested for the data in Table 11.11 In [14] , the 
model including 2 two-factor interaction effects, A x C and B x D is considered. Here 
we call this model as Mi model. The model matrix of the Mi model is constructed by 
adding two columns, 

( \ 1-1-1 11-1-1-1-11 1-1-1 1 1 V 

V 1 -1 1-1-11-1 1 1-11-1-1 1 -1 1 ) 

to the model matrix of the main effect model. Note that the above two columns are the 
element-wise product of the (first, third) columns and the (second, fourth) columns of D, 
respectively. Using this model matrix (or configuration matrix as its transpose), we can 
investigate the fitting of the Mi model in the similar way. The fitted value under the Mi 
model is 

pi = (64.53, 47.25, 53.15, 151.08, 30.43, 46.79, 115.24, 32.53, 
49.42, 46.13, 70.90, 360.54, 35.19, 30.26, 232.14, 51.42)'. 

Then the likelihood ratio for the observed data is calculated as T(y°) = G 2 (y°) = 19.0927 
and the corresponding asymptotic p value is 0.00401 from the asymptotic distribution 
Xq- Though this result still suggests the significantly poor fitting of the M 2 model, much 
larger p value and much smaller likelihood ratio than those of the main effect model tell 
us that M 2 model is much better than the main effect model. We have the Markov chain 
Monte Carlo estimate of p value as p = 0.0031 from 100, 000 Monte Cairo sample after 
50, 000 burn-in-steps. 

An important point here is that the model matrix for the models with interaction 
effects constructed in this way is equivalent to the model matrix for the main effect model 
for some regular fractional factorial design of resolution III or more. For example, the 
model matrix for the Mi model in Example 11.31 is equivalent to the main effect model for 
the 2 9-5 fractional factorial design defined from 

ABDE = ACDF = BCDG = ACH = BDJ = I. 

This relation is given by adding H = AC and J = BD to (OQ), as if there exists two 
additional factors H and J in Table 11.11 In this paper, we consider the relation between 
the model matrix and the cut ideals. From the above considerations, we can restrict our 
attentions to the main effect models to consider the relations to the cut ideals. We give 
the relation in Section 12.21 We will consider the models including interaction effects again 
in Section [2731 to consider the relation from the practical viewpoint. 
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2 Two-level regular fractional factorial designs and 
cut ideals 

In this section, we show that a cut ideal for a finite connected graph can be characterized 
as the toric ideal I(M') for a model matrix of the main effect model for some regular 
two-level fractional factorial designs. 

2.1 Cut ideals 

We start with the definition of the cut ideal. Consider a connected finite graph G = (V, E). 
We also consider unordered partitions A\B of the vertex set V. Let V(V) be the set of 
the unordered partitions of V, i.e., 

V(V) = {A\B | A U B = V, A n B = 0}. 

We introduce the sets of indeterminates | {i, j} G E}, {Uj j} G E} and {(Ja\b \ A\B G 
V{V)}. Let 

K[q) = K[q AlB | A\B G V(V)), 

K[s,t) = K[ Sij ,t i:j | {i,j} G £] 

be polynomial rings over a field i^. For each partition A\B G V(V), we define a subset 
Cut(A\B) of the edge set E as 

Cut(A|5) = {{i, j} e E \ i e A,j e B or i e B,j e A}. 

Define homomorphism of polynomial rings as 

<p G ■ K[q\ ^ K[s,t), Qa\b 1 y 1 | ■ I | ty. (10) 

{ij}eCut(A|B) {i,j}GS\Cut(A|_B) 

We may think of s and t as abbreviations for "separated" and "together", respectively. 
Then the cut ideal of the graph G is defined as Iq = Ker(0c). We also use the following 
two examples given in [21j. 

Example 2.1 (Complete graph on four vertices). Let G = K 4 be the complete graph on 
four vertices V = {1, 2, 3, 4}. Then the edge set is E = {12, 13, 14, 23, 24, 34}. The map 
(pK 4 is specified by 

<?0|1234 | — > ^12^13^14^23^24^34 
9l|234 l— * s 12Sl3Si4t23^24^34 
<?2|134 l— * S 12^13^14"S23 S 24^34 
<?3|124 l— * £l2Sl3£l4<S23^24"S34 
?4|123 ^ ^12^13'5l4^23'S24S34 
9l2|34 l— ^ ^12-5l3Si4S23-S24^34 
9l3|24 ^ s 12^13' s 14'523^24'534 
9l4|23 ^ s 12'5l3tl4^23 s 24'S34- 

In this case, the cut ideal is a principal ideal given by 

Ik* — (O'0|12345l2|34?13|249l4|23 — Q , 1|234Q , 2|134^3|124Q , 4|123 ) ■ 
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Example 2.2 (4-cycle). Let G = C 4 be the 4-cycle with 

V = {1, 2, 3, 4}, E = {12, 23, 34, 14}. 
The map 4>c 4 is derived from in Example 12.11 by setting 

s 13 — ^13 — s 24 — ^24 — 1 

as 

9011234 | — > ^12^14^23^34 



I— > S\2 S 14^23^34 
H- >■ S 12 tl4S23t34 
' ^ ^12^14'§23' s 34 
I— >■ tl2Sl4^23 s 34 
^12'5l4'S23^34 
S12S14S23S34 
■Sl2^14^23'S34- 



I — )■ 

1 — y 



9l|234 
92 1 134 
93|124 
94|123 
9l2|34 
9l3|24 
9l4|23 

In this case, the cut ideal is given by 

Ic 4 = (90|12349l3|24 — 9l|23493|124, 90|12349l3|24 — 134^4| 123 5 90|12349l3|24 — <2 , 12|34'3 f 14|23 ) • 

Now we relates the cut ideals to the regular two-level fractional factorial designs. We 
express the map <\>q by 2' y ' _1 x 2\E\ matrix H = {/iA|B,e} where each row of H represents 
A\B G V(V) and each two columns of H represents E as 

(1,0) if e G E\Cut(A\B) 
(0,1) if e G CvX{A\B). 

Note that there are 1^(^)1 = 2' v '+ 1 unordered partitions of V. We also see that each 
two columns of H correspond to t and s. Then the cut ideal, the kernel of 4>g of (flOl) - is 
written as the toric ideal of the configuration matrix H'. 

Example 2.3 (4-cycle, continued). For the case of G = C4 of Example 12.21 the matrix 
H can be written as follows. 



h A 



B,e 





tl2 


S12 


tl4 




£23 


S23 


£34 


S34 


90|1234 


1 





1 





1 





1 





93|124 


1 





1 








1 





1 


94|123 


1 








1 


1 








1 


9l2|34 


1 








1 





1 


1 





9l4|23 





1 


1 





1 








1 


92 1 134 





1 


1 








1 


1 





911234 





1 





1 


1 





1 





9l3|24 





1 





1 





1 





1 



11) 



The kernel of H' coincides to the kernel of M' of ([3]) for the two- level design D of \E\ 
factors with 2' v/ ' _1 runs, where the level of the factor X e for the run A\B G V(V) is given 
by the following map: 

X e : V(V) {+1,-1} 

, , +1 ifee^\Gut(il|S) (12) 
1 1 7 " -1 if e G Cut{A\B) 
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Example 2.4 (4-cycle, continued). For the case of G = the map X e of ( fl2|) gives the 
design matrix D as follows. 





X\2 


-Xl4 


X23 


-X34 


3011234 


1 


1 


1 


1 


?3|124 


1 


1 


-1 


-1 


94|123 


1 


-1 


1 


-1 


?12|34 


1 


-1 


-1 


1 


9l4|23 


-1 


1 


1 


-1 


921134 


-1 


1 


-1 


1 


9l|234 


-1 


-1 


1 


1 


9l3|24 


-1 


-1 


-1 


-1 



For this D, it is easily seen that Ker(M') coincides to Ker(if' ) if H is given by (TlTj) . 



2.2 Regular designs and cut ideals 

In Example 12.44 we obtain the toric ideal for the main effect model of the regular two- 
level fractional factorial designs defined by X12X14X23X34 = 1 from the the cut ideal of 
G = C4. In fact, there is a clear relation between finite connected graphs G and regular 
two- level designs D . As we have seen in Example I2.4[ the cut ideal for G can be related 
to the design of p = \E\ factors with k = 2' v ''~ 1 runs. Since each factor of this design 
corresponds to the edge E of G, we write each factor Xij for {i, j} G E. Since there are 
2 P runs in the full factorial design of p factors, the design obtained from G by the relation 
ffl2|) is a 2' y '~ 1_:P fraction of the full factorial design of p factors. We show this fraction is 
specified as the regular fractional factorial designs. 

Let G = (V, E) be a finite connected graph with the edge set E = {ei, . . . , e p }. Then, 
the cycle space C{G) of G is a subspace of ¥ 2 spanned by 

{e n H h e ir e Fjf 1 (e h , . . . , e ir ) is a cycle of G j , 

where ej is the jth coordinate vector of fJj . On the other hand, the cut space C*(G) of 
G is a subspace of ¥ 2 defined by 



C*(G) ={ Yl e > E E 

ejGCut(A|B) 



2 



A\B E V(V) 



Fix a spanning tree T of G. For each e G E \ T, the set T U {e} has exactly one cycle 
C e of G. Such a cycle C e is called a fundamental cycle of G. Since T has |V| — 1 edges, 
there are \E\ — \V\ + 1 edges in E \ T. It then follows that there exists \E\ — \V\ + 1 
fundamental cycles in G. The following proposition is known in graph theory [8]: 

Proposition 2.5. Let G = (V, E) be a finite connected graph. Then, we have the follow- 
ing: 



C*{G) = (= (veFjf 1 v u = for all u G 
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(ii) Given spanning tree of G, the cycle space C(G) is spanned by 

| + • • • + e ir G F^' , . . . , e* r ) is a fundamental cycle of Gj ; 

(iii) dimC(G) = |£| -|^| + 1 and dimC*(G) = |V| - 1. 



By Proposition 12. 5[ we have the following: 

Theorem 2.6. Let G = (V, E) be a finite connected graph and let D be the design matrix 
of \E\ factors with 2' v ''~ 1 runs defined by [W\l . Then D is a regular fractional factorial 
design with all relations 

X en (A\B)X ei2 (A\B) ■ ■ ■ X ei jA\B) = 1, (13) 
where (e^, . . . , e im ) is a fundamental cycle of G. 
Proof. The equation (JT3"|) is equivalent to the equation 




(e n + --- + e,J- | > j e 3 | =0 

in F^'. Thus, the assertion follows from (i) and (ii) of Proposition 12.51 □ 

It may be a helpful to see a typical example. 
Example 2.7. Consider G = (V, E) given by 

V = {1, 2, 3, 4, 5, 6}, E = {12, 13, 23, 34, 45, 56, 26}. 

,5 




From ( I13p . we see that G corresponds to 2 7 3 design with 

-^12-^13-^23 = -^13-^34-^45-^15 = -^12-^26-^56-^15 = 1- 

Note that the relations corresponding the dependent cycles such as {23, 34, 45, 56, 26} can 
be derived as 

-^23-^34-^45-^56-^26 = (-^12-^13-^23) (-^13-^34-^45-^15) (-^12-^26-^56-^15) = 1- 

Theorem 12.61 shows the relation of the cut ideals and regular two-level fractional fac- 
torial designs. For a given connected finite graph, we can consider corresponding regular 
two-level fractional factorial designs from Theorem 12.61 Unfortunately, however, the con- 
verse does not always hold. For given regular two-level fractional factorial designs (strictly, 
we should say that "for given designs and models" , which we consider in Section [273]) . it 
does not always exist corresponding connected finite graphs. 
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Proposition 2.8. If a TP q design corresponds to a finite graph by the relation (fT^j . then 
we have p < ( p ' q 2 +1 ) 

Proof. Corresponding connected graphs G = (V,E) does not exist because \E\ = p and 
\V\ = p — q + 1 must be satisfied if it exists. (There are ( p ~2 +1 ) edges in K p - q+ i.) □ 

Thus, obvious counterexamples for the converse are given since some regular 2 p ~ q 
designs satisfy ( p- n + ) < P (^ or example, {Pi O) — (5, 3), (5, 4), (6, 4), (6, 5) and so on). On 
the other hand, a necessary condition related with the resolution is as follows. 

Proposition 2.9. If a 2 p ~ q design of resolution IV or more corresponds to a finite graph 
by the relation (fTDj). then we have p < |_(p — Q + 1) 2 /4j . 

Proof. Mantel's theorem in graph theory says that the number of edges in triangle- free 
graph with n vertices is at most [n 2 /4j. □ 

If the resolution of a design is V or more, then similar results are obtained by the 
results in [13] . From these considerations, an important question arises. 

Question 2.10. Characterize regular two-level fractional factorial designs that can cor- 
respond to a finite graph by the relation ( [0| ). 

A complete answer to this question is not yet obtained at present. We give results for 8 
runs and 16 runs designs in Section [2T3l We present several fundamental characterizations 
in the rest of this section. Note that the above correspondence is not one-to-one even if it 
exists. In fact, for any finite connected graph G, we can specify a design D uniquely by 
( [TBI . However, for a given design G, we can consider several graphs satisfying the relation 
(fl~3]) if it exists. 

Example 2.11 (2 5_1 design with X12A13A23 = 1 of 5 factors). Consider 2 5_1 fractional 
factorial design X 12 X 13 X 2 3 = 1 of 5 factors, or, ABC = I in the convention of designed 
experiment literature. There are several corresponding graphs that give this design such 
as follows. 

2 4 5 2 4 




Later, we will be able to understand this by Proposition 13.11 (Both of two graphs are 
0-sum of the same pair of graphs.) 

Now we show two important special cases, designs corresponding to complete graphs 
and trees. 

Example 2.12 (Complete graph on four vertices (continued)). Let G = K 4 be the 
complete graph on four vertices V = {1, 2, 3, 4} in Example 12.11 
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From (|13p . we have the following regular 2 6 3 fractional factorial design. 





X12 


X\3 


Xl4 


A 2 3 


A 2 4 


A34 


70|1234 


1 


1 


1 


1 


1 


1 


93|124 


1 


-1 


1 


-1 


1 


-1 


?4|123 


1 


1 


-1 


1 


-1 


-1 


9l2|34 


1 


-1 


-1 


-1 


-1 


1 


9l4|23 


-1 


-1 


1 


1 


-1 


-1 


?2|134 


-1 


1 


1 


-1 


-1 


1 


9l|234 


-1 


-1 


-1 


1 


1 


1 


9l3|24 


-1 


1 


-1 


-1 


1 


-1 



The defining relation of this design is 

Xi 2 Xi 3 X 23 — X12XUX2A = X13X14X34 = 1, 

where the three terms Xi 2 Xi 3 X 2 3, Xi 2 Xi 4 X 24 , X13X14X34 correspond to the independent 
cycle of K4. 

Generalizing Example 12.121 we summarize the following important cases. 

Corollary 2.13. Let G = K n be the complete graph on \V\ = n vertices. Then, G is 
specified as the regular 2 Cl ~ c ' 2 fractional factorial design of c\ two-level factors by ( TJgj) . 
where 




The defining relation of this design is written as XuXxjXij = 1 for any pair (i,j) with 
2 < i < j < n. 

Another important case is as follows. 

Corollary 2.14. Any spanning tree G = (V,E) is specified as the full factorial design of 
\V\ — 1 two-level factors by ^2\)■ 



2.3 Models for the designs with 8 runs and 16 runs 

It seems very difficult to answer Question 12.101 in general. As special cases, we show that 
2 P_1 and 2 P ~ 2 fractional factorial designs can relate to graphs from algebraic theories in 
Section [31 As another approach, we investigate all the practical models arising in the 
two-level fractional factorial designs with 8 runs and 16 runs. In Section [273| we consider 
models with interaction effects to classify the cases arising in applications. 

First we consider models for the designs with 8 runs. Among the regular designs with 
8 runs, the most frequently used designs are listed in Table [3 We ignore 2j r 7 4 design in 
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Table 2: 2 P q fractional factorial designs with 8 runs (p — q = 3) 



Number of factors p 


Resolution 


Desi; 


5n Generators 






4 


IV 


D = 


ABC 






5 


III 


D = 


AB, E = AC 






6 


III 


D = 


AB, E = AC, 


F 


= BC 


7 


III 


D = 


AB, E = AC, 


F 


= BC,G = ABC 



Table [2] since the main effect model is saturated and cannot be tested in our method. For 
the other 3 designs, 2jy , 2^ 2 and 2^7 3 designs, we consider models to be tested. 
For 2^ design of ABCD = I, we can consider 3 models as follows. 

• Main effect model: 

We write it as A/B/C/D. 

• Models with all the main effects and 1 two-factor interaction effect: 

Without loss of generality, we consider A x B as the interaction effect. We write it 
as AB/C/D. 

• Models with all the main effects and 2 two-factor interaction effects: 

Note that we cannot consider models including, say, A x B and C x D, because 
they are confounded. Without loss of generality, we consider A x B and A x C as 
the two interaction effects. We write it as AB/AC/D. 

Note that we cannot consider models with more than 2 two-factor interactions because 
there are 8 parameters in the saturated models for the design with 8 runs. For these 
models, we can construct model matrix and consider existence of corresponding graphs. 
As we have seen, the model A/B / C /D can relate to the 4-cycle. For the model AB / C /D, 
we introduce imaginary factor E = AB and consider the main effect model for the 2 5-2 
design defined by 

ABCD = ABE (= CDE) = I 

as we have seen in the last of Section [TJ This corresponds to the graph with 5 edges as 
follows. 



C 




B 



Similarly, for the model AB/AC/D, introducing imaginary factors E = AB and F = AC 
and consider the main effect model for the 2 6-3 design defined by 

ABCD = ABE = ACF = I, 

we see the corresponding graph is K±. 
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Table 3: Models for the designs in Table [2] 



Design 


Num. of parameters 


Model 


Index 


Z IV 


5 


A/B/C/D 


[1] 




6 


AB/C/D 


[2] 




7 


AB/AC/D 


[3] 


Z III 


6 


A/B/C/D/E 


[4] 




7 


A/BC/D/E 


[5] 


9 6-3 
Z III 


7 


A/B/C/D/E/F 


[6] 



For 2 I 5 I 7 2 design of D = AB, E = AC, we can consider only one additional interac- 
tion effect to the main effect model. Therefore there are the following 2 models to be 
considered. 

• Main effect model: 

We write it as A/B/C/D/E. 

• Models with all the main effects and 1 two-factor interaction effect: 

Note that we cannot consider models including, say, A x B, because it is confounded 
to the main effect of D. Without loss of generality, we consider B x C as the 
interaction effect, with we write A/BC/D/E. 

For these models, we can construct model matrix and consider existence of corresponding 
graphs. The model A/B/C/D/E can relate to the graph as follows. 



C 




We see the model A/BC/D/E can relate to K4 by introducing imaginary factor F = BC. 

Finally, for 2 6 m 3 design of D = AB, E = AC, F = BC, only the main effect model 
can be considered, which can be relate to K±. 

From the above considerations, we have complete classification of all the hierarchical 
models for the designs with 8 runs in Table [21 We summarize the results in Table [3] and 
Table H 

Next we consider models for the designs with 16 runs in a similar way. The most 
frequently used designs with 16 runs are given Section 4 of [22], which we show in Table 
For the designs in Tabled we consider possible models. For the designs with 16 runs, 
we can test models with less than 16 parameters. However, the models with parameters 
more than 11 (= 10 + 1) cannot relate to graphs obviously because there are 10 edges in 
K5. Therefore we only consider models with at most 11 parameters. 

Moreover, we can use the fact that if some model has a corresponding graph, its each 
submodel also has a corresponding graph. We can confirm this fact by tracing the argu- 
ments of imaginary factors reversely. Suppose some model M. with the interaction factor 
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Table 4: Graphs corresponding to the models in Table [3] 
Graph Models 




Table 5: 2 P q fractional factorial designs with 16 runs (p — q = 4) 
Number of factors p Resolution Design Generators 



5 


V 


E = 


ABCD 






6 


IV 


E = 


ABC, F = 


ABD 




7 


IV 


E = 


ABC, F = 


ABD, G = 


ACD 


8 


IV 


E = 


ABC, F = 


ABD, G = 


ACD 






H = 


BCD 






9 


III 


E = 


ABC, F = 


ABD, G = 


ACD 






H = 


BCD, J = 


ABCD 




10 


III 


E = 


ABC, F = 


ABD, G = 


ACD 






H = 


BCD, J = 


ABCD, K 


= CD 



Ai x • • • x A s has a corresponding graph G with q dimensional cycle space. The graph G 
is constructed by introducing imaginary factor X to consider the cycle {X, A x . . . , A s }. 
It then follows that there exists a spanning tree of G such that {X, Ai . . . , A s } is a fun- 
damental cycle. Then, deleting the edge X yields a graph with q — 1 dimensional cycle 
space, which is the corresponding graph of the model Ai \ {A x x ■ ■ • x A s }. In particular, 
if the main effect model does not have a corresponding graph, all models with interac- 
tion effects also do not have a corresponding graph for this design. For the designs of 
Table [5j we see that the main effect models for 2[y 3 , 2jy 4 , 2^ 5 and 2j I ° I ~ 6 do not have 
corresponding graphs. (For example, if a 2 p ~ q = 2 4 design of resolution IV corresponds 
to a finite graph by the relation ( II 2p . then we have p < [(4 + 1) 2 /4J = 6 by Proposition 
12.91 ) Therefore we consider the models for 2 V _1 and 2jy 2 designs. The distinct models 
for these designs are given in Table [6] and Table [71 For the models with no correspond- 
ing graphs, only minimal models are included in Table [6] and Table UJ For example, the 
model AB/AC/AD/E of the 2 v _1 -design does not have a corresponding graph. This is 
a minimal model in the sense that any submodel of it, i.e., AB/AC/D/E, AB/C/D/E 
and A/B/C/D/E, has a corresponding graph. From the consideration above, we see all 
models including it, i.e., AB/AC/AD/AE or AB/AC/BC/AD/AE, for example, also 
do not have a corresponding graph. 
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Table 6: Models for the 2^r 1 -design of E = 


ABCD 


iNUiii. 01 parameters 


iviociei 


Index 





A /"R /Fl /TT 


[5-11 


1 


ArJ/U/lJ/rj 


[5-21 


o 
o 


Ad / A v 1 l) 1 Hi 








[5-41 


9 


AB/AC/BD/E 


[5-51 




AB/AC/DE 


[5-6] 




AB/AC/AD/E 


(no graph) 




AB/AC/BC/D/E 


(no graph) 


10 


AB/AC/BD/CE 


[5-7] 




AB/AC/BD/CD 


(no graph) 


11 


AB/AC/BD/CE/DE 


[5-8] 


Table 7: Models for the 2^ 2 -design of E = ABC, F = ABD 


INTnm of ri^r^mptPT*^ 


Model 


Index 


7 


A /"R/C /"D /F,/F 

/ J-> / / J--/ / J—J / 17 


if, ii 

[0-1 J 


8 


AB/C/D/E/F 


[6-21 




AC/B/D/E/F 


[6-3] 


9 


AB/AC/D/E/F 


[6-4] 




AB/CD/E/F 


[6-5] 


10 


AB/AC/AD/E/F 


[6-6] 




AB/AD/BC/E/F 


[6-7] 




AB/AC/CD/E/F 


[6-8] 




AB/AC/DE/F 


[6-9] 




AC/BD/EF 


[6-10] 




AB/AC/AE/D/F 


(no graph) 




AB/AC/BC/D/E/F 


(no graph) 


11 


AB /AC /AD /CD/E/F 


[6-11] 




AD/DE/DF/BC 


[6-12] 




AD/BC/CF/DF/E 


[6-13] 




AC/AD/CD/CF/B/E 


(no graph) 




AB/AC/CF/CD/E 


(no graph) 




AC/BD/CD/CF/E 


(no graph) 




AC/BC/AD/AF/E 


(no graph) 




AB/AC/AD/CF/E 


(no graph) 




AC/AD/BC/DF/E 


(no graph) 




AB/BC/AD/EF 


(no graph) 




AD/BC/BD/EF 


(no graph) 




AD/BC/BE/DF 


(no graph) 




AD/AF/BC/BE 


(no graph) 
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For these models, we can construct model matrix and consider existence of correspond- 
ing graphs. The results are shown in Table EJ 

3 Application 

In this section, we apply known results on cut ideals to the regular two-level fractional 
factorial designs. First we study fundamental facts on cut ideals appearing in [21]. Let 
G\ = (V\, Ei) and G 2 = (V 2 , E 2 ) be graphs such that V\ fl V 2 is a clique of both graphs. 
The new graph G = GyZG 2 with the vertex set V — V\ U Vi and edge set E = E% U E 2 
is called k-sum of G\ and G 2 along V\ fl V 2 if the cardinality of V\ fl V 2 is k + 1. In this 
section, we only consider 0,1, 2-sums. Let 

d d 

* =n<iMBi-n<ic i \D i 

8=1 1=1 

be a binomial in I Gl of degree d. Since V\ fl V 2 is a clique of Gi and | Vi fl V2I < 3, we may 
assume that Ai fl Vi fl V 2 — Cj fl V\ fl for all i. For any ordered list -EF of d partitions 
ofV 2 \Vi, 

EF = (E l \F 1 ,E 2 \F 2 , . . . , -Edl-^d), 
we define the binomial in Iq of degree d by 

d d 
8=1 i=l 

If F is a set of binomials in Iq x , then we define 

Lift(F) = I f SF I f G F, = {EilF^A . 
On the other hand, let Quad(Gi, G 2 ) be the set of all quadratic binomials 

<?AUC*i UC 2 IflUDj UZ?2 QAVJE 1 UE 2 \BUFx UF 2 — QA\JE 1 UC 2 \BUFx UD 2 QAUd UE 2 \BUDiUF 2 

where 

• A I B is an unordered partition of V\ D V2; 

• Ci|Di and Fi|Fi are ordered partitions of V\ \ V 2 ; 

• C 2 \D 2 and F 2 |F 2 are ordered partitions of V 2 \V\. 
Then, the following is known: 

Proposition 3.1 ([21]). Let G = G\'*G> 2 be aO, 1 or 2-sum of G\ and G 2 and let Fj be 

a set of binomial generators of Iq. for i = 1,2. Then Iq is generated by 

M = Lift(Fx) U Lift(F 2 ) U Quad(C 1 , G 2 ). 

Moreover, z/F, is a Grobner basis of Id fori = 1,2, then there exists a monomial order 
such that M is a Grobner basis of Iq- 



19 



Table 8: Graphs corresponding to the models in Tables |6] and [7] 



Graph Models 
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Example 3.2. The graph Gg in Table |8] is a 1-sum of the complete graph K4 and a cycle 
C3 of length 3. Since Ik 4 is a principal ideal in Example 12.11 and Io 3 is a zero ideal, Ia g is 
generated by the binomials of degree 2 and 4. On the other hand, the graph do in Table 
[H]is a 2-sum of the complete graphs K± and K4. Thus, Ia 10 is generated by the binomials 
of degree 2 and 4, too. 

A graph G is called a ring graph if G is obtained by 0/1-sums of cycles and edges. It 
is known that ring graphs have no K4 minor. 

Proposition 3.3 ([18]). If G is a ring graph, then Iq has a quadratic Grobner basis. 

Example 3.4. It is easy to see that the graph Gi in Table [8] is a ring graph if and only 
if i G {1, 2, 5, 6}. Thus the cut ideal has a quadratic Grobner basis for % G {1, 2, 5, 6}. 

Let e = {i,j} G E be an edge of a graph G = (V, E). Then, the new graph G \ e :— 
(V, E \ {e}) is called the graph obtained from G by deleting e. On the other hand, the 
new graph G/e obtained by the procedure 

(i) Identify the vertices i and j; 

(ii) Delete the multiple edges that may be created while (i); 

is called the graph obtained from G by contracting e. A graph H is said to be a minor 
of G if it can be obtained from G by a sequence of deletions and/or contractions of edges 
(and deletions of vertices). The following theorem is conjectured by Sturmfels-Sullivant 
[2Tj and proved by Engstrom: 

Proposition 3.5 ( [10J ) . The toric ideal Iq is generated by quadratic binomials if and 
only if G has no K4 minor. 

Example 3.6. It is easy to see that the graph Gi in Table E] does not have K4 minor if 
and only if i G {1,2, 3, 5, 6}. Thus the cut ideal Jg^ is generated by quadratic binomials 
if and only if i G {1, 2, 3, 5, 6}. 

Let G be a graph with vertex set V = {1, . . . , n} and edge set E. The suspension of 
the graph G is the new graph G whose vertex set equals V U {n + 1} and whose edge set 
equals E U {{i, n + 1} | i G V}. It is known (21] that the toric ideal of the binary graph 
model of G equals to the cut ideal Iq of G. 

Proposition 3.7 (p3]). Let G be the suspension of G. Then Iq is generated by binomials 
of degree < 4 if and only if G has no K4 minor. 

Example 3.8. The graph Gg in Table M is the suspension of a cycle C4 of length 4. Since 
C4 has no K4 minor, Iq 8 is generated by the binomials of degree < 4. On the other hand, 
the graph Gn = K 5 in Table M is the suspension of K^. Thus, Io n is not generated by 
the binomials of degree < 4. 

We now apply these known results to our problem. 
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Theorem 3.9. Let D be a regular fractional factorial design with at most two defining 
relations. Then there exists a connected graph G = (V, E) such that 

• D is the design matrix of \E\ factors with 2' v ''~ 1 runs defined by [LW . 

• Ig is generated by quadratic binomials. 

Moreover, if D has exactly one defining relations, then Iq has a quadratic Grobner basis. 
Proof. Let D be a regular 2 P_1 fractional factorial design with the defining relation 

Ai A 2 A r = I. 

Let G = (V, E) be a graph on the vertex set V — {1,2, ... ,p} with the edge set 

E = {12, 23, . . . , (r - l)r, lr, r(r + 1), (r + l)(r + 2), . . . , (p - l)p}. 

Then, it is easy to see that D is the design matrix of \E\ factors with 2' y ' _1 runs defined 
by (jT2"j) . Since G is a ring graph, I G has a quadratic Grobner basis by Proposition 13.31 
Let D be a regular 2 P ~ 2 fractional factorial design with the defining relation 

A]A 2 • • • A r B!B 2 ■ • ■ B s = B!B 2 • ■ ■ B s CiC 2 • • • C t — I, 

where < s G Z and 1 < r, t G Z. Let G = (V, E) be the graph in Figure [2j It then 



r edges 




t edges 



Figure 2: A graph for a regular 2 P 2 fractional factorial design 

follows that D is the design matrix of \E\ factors with 2' v '' _1 runs defined by (fl2"l) . It is 
easy to see that G has no K4 minor. Hence, by virtue of Proposition 13. 5[ Iq is generated 
by quadratic binomials. □ 

Remark 3.10. Explicit description of binomials appearing in Theorem 13.91 is given in 
[18] and [10] . The set of generators of Iq consisting of quadratic binomials for a regular 
2 P_1 fractional factorial design is studied in [2]. 
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